Picture for Bo An

Bo An

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Add code
Feb 09, 2026
Viaarxiv icon

Conditional Performance Guarantee for Large Reasoning Models

Add code
Jan 30, 2026
Viaarxiv icon

Harnessing Reasoning Trajectories for Hallucination Detection via Answer-agreement Representation Shaping

Add code
Jan 24, 2026
Viaarxiv icon

History Is Not Enough: An Adaptive Dataflow System for Financial Time-Series Synthesis

Add code
Jan 15, 2026
Viaarxiv icon

Bayesian Robust Financial Trading with Adversarial Synthetic Market Data

Add code
Jan 14, 2026
Viaarxiv icon

Failure-Aware RL: Reliable Offline-to-Online Reinforcement Learning with Self-Recovery for Real-World Manipulation

Add code
Jan 12, 2026
Viaarxiv icon

AgentOCR: Reimagining Agent History via Optical Self-Compression

Add code
Jan 08, 2026
Viaarxiv icon

CaveAgent: Transforming LLMs into Stateful Runtime Operators

Add code
Jan 04, 2026
Viaarxiv icon

C2PO: Diagnosing and Disentangling Bias Shortcuts in LLMs

Add code
Dec 29, 2025
Viaarxiv icon

FineFT: Efficient and Risk-Aware Ensemble Reinforcement Learning for Futures Trading

Add code
Dec 29, 2025
Viaarxiv icon